Search CORE

27 research outputs found

Backpropagating through Markov Logic Networks

Author: Betz P
Minervini P
Niepert M
Stuckenschmidt H
Publication venue: CEUR
Publication date: 27/10/2021
Field of study

We integrate Markov Logic networks with deep learning architectures operating on high-dimensional and noisy feature inputs. Instead of relaxing the discrete components into smooth functions, we propose an approach that allows us to backpropagate through standard statistical relational learning components using perturbation-based differentiation. The resulting hybrid models are shown to outperform models solely relying on deep learning based function fitting. We find that using noise perturbations is required to allow the proposed hybrid models to robustly learn from the training data

UCL Discovery

Learning Edge Representations via Low-Rank Asymmetric Projections

Author: Bruna Joan
Cao Shaosheng
Chen Haochen
Dai Hanjun
Gori M.
Ioffe Sergey
Li Y.
Luo Y.
Mikolov T.
Niepert M.
Pan S.
Wang H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 13/09/2017
Field of study

We propose a new method for embedding graphs while preserving directed edge information. Learning such continuous-space vector representations (or embeddings) of nodes in a graph is an important first step for using network information (from social networks, user-item graphs, knowledge bases, etc.) in many machine learning tasks. Unlike previous work, we (1) explicitly model an edge as a function of node embeddings, and we (2) propose a novel objective, the "graph likelihood", which contrasts information from sampled random walks with non-existent edges. Individually, both of these contributions improve the learned representations, especially when there are memory constraints on the total size of the embeddings. When combined, our contributions enable us to significantly improve the state-of-the-art by learning more concise representations that better preserve the graph structure. We evaluate our method on a variety of link-prediction task including social networks, collaboration networks, and protein interactions, showing that our proposed method learn representations with error reductions of up to 76% and 55%, on directed and undirected graphs. In addition, we show that the representations learned by our method are quite space efficient, producing embeddings which have higher structure-preserving accuracy but are 10 times smaller

arXiv.org e-Print Archive

Crossref

Learning Discrete Structures for Graph Neural Networks

Author: Franceschi L
He X
Niepert M
Pontil M
Publication venue: PMLR
Publication date: 01/01/2019
Field of study

Graph neural networks (GNNs) are a popular class of machine learning models whose major advantage is their ability to incorporate a sparse and discrete dependency structure between data points. Unfortunately, GNNs can only be used when such a graph-structure is available. In practice, however, real-world graphs are often noisy and incomplete or might not be available at all. With this work, we propose to jointly learn the graph structure and the parameters of graph convolutional networks (GCNs) by approximately solving a bilevel program that learns a discrete probability distribution on the edges of the graph. This allows one to apply GCNs not only in scenarios where the given graph is incomplete or corrupted but also in those where a graph is not available. We conduct a series of experiments that analyze the behavior of the proposed method and demonstrate that it outperforms related methods by a significant margin

UCL Discovery

LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching

Author: Albarqouni Shadi
Cao Tri
Diep Nghiem T.
Ho Nhat
Nguyen Binh T.
Nguyen Duy M. H.
Nguyen Hoang
Niepert Mathias
Pham Tan N.
Sonntag Daniel
Swoboda Paul
Xie Pengtao
Publication venue
Publication date: 09/07/2023
Field of study

Obtaining large pre-trained models that can be fine-tuned to new tasks with limited annotated samples has remained an open challenge for medical imaging data. While pre-trained deep networks on ImageNet and vision-language foundation models trained on web-scale data are prevailing approaches, their effectiveness on medical tasks is limited due to the significant domain shift between natural and medical images. To bridge this gap, we introduce LVM-Med, the first family of deep networks trained on large-scale medical datasets. We have collected approximately 1.3 million medical images from 55 publicly available datasets, covering a large number of organs and modalities such as CT, MRI, X-ray, and Ultrasound. We benchmark several state-of-the-art self-supervised algorithms on this dataset and propose a novel self-supervised contrastive learning algorithm using a graph-matching formulation. The proposed approach makes three contributions: (i) it integrates prior pair-wise image similarity metrics based on local and global information; (ii) it captures the structural constraints of feature embeddings through a loss function constructed via a combinatorial graph-matching objective; and (iii) it can be trained efficiently end-to-end using modern gradient-estimation techniques for black-box solvers. We thoroughly evaluate the proposed LVM-Med on 15 downstream medical tasks ranging from segmentation and classification to object detection, and both for the in and out-of-distribution settings. LVM-Med empirically outperforms a number of state-of-the-art supervised, self-supervised, and foundation models. For challenging tasks such as Brain Tumor Classification or Diabetic Retinopathy Grading, LVM-Med improves previous vision-language models trained on 1 billion masks by 6-7% while using only a ResNet-50.Comment: Update Appendi

arXiv.org e-Print Archive

Facets of Distribution Identities in Probabilistic Team Semantics

Author: A Durand
Arnaud Durand
E Grädel
E Grädel
Fausto Barbero
J Corander
J Ferrante
L Berman
M Ben-Or
M Hannula
M Hannula
M Niepert
Miika Hannula
Miika Hannula
P Galliani
P Galliani
S Abramsky
Tapani Hyttinen
W Hodges
Publication venue: Springer
Publication date: 01/01/2019
Field of study

We study probabilistic team semantics which is a semantical framework allowing the study of logical and probabilistic dependencies simultaneously. We examine and classify the expressive power of logical formalisms arising by different probabilistic atoms such as conditional independence and different variants of marginal distribution equivalences. We also relate the framework to the first-order theory of the reals and apply our methods to the open question on the complexity of the implication problem of conditional independence.Peer reviewe

arXiv.org e-Print Archive

Crossref

Helsingin yliopiston digitaalinen arkisto

Information Integration with Provenance on the Semantic Web via Probabilistic Datalog+/–

Author: A Calì
A Calì
B Cate ten
D Maier
DS Johnson
G Gottlob
H Andréka
HJ Levesque
J Cheney
J Euzenat
L Haas
L Moreau
M Niepert
P Buneman
PA Bernstein
R Fagin
R Fagin
R Fagin
T Lukasiewicz
T Lukasiewicz
XL Dong
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

"Towards Collaboratively Learning and Populating Ontologies for the Social-Semantic Web" by Mathias Niepert, with Jessica Rubart as coordinator

Author: Mathias Niepert
Niepert M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

IT Risk Management with Markov Logic Networks

Author: D. Jain
D. Jain
H. Zawawy
J. Gray
M. Niepert
M. Richardson
P. Weber
R. Braz
S. Kaplan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

We present a solution for modeling the dependencies of an IT infrastructure and determine the availability of components and services therein using Markov logic networks (MLN). MLNs offer a single representation of probability and first-order logic and are well suited to model dependencies and threats. We identify different kinds of dependency and show how they can be translated into an MLN. The MLN infrastructure model allows us to use marginal inference to predict the availability of IT infrastructure components and services. We demonstrate that our solution is well suited for supporting IT Risk management by analyzing the impact of threats and comparing risk mitigation efforts

Crossref

MAnnheim DOCument Server

Probabilistic Optimization of Semantic Process Model Matching

Author: Barros A.
Dijkman R.M.
Gal A.
Kindler E.
Leopold H.
Mendling J.
Niepert M.
Stuckenschmidt H.
Weidlich M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Business process models are increasingly used by companies, often yielding repositories of several thousand models. These models are of great value for business analysis such as service identification or process standardization. A problem is though that many of these analyses require the pairwise comparison of process models, which is hardly feasible to do manually given an extensive number of models. While the computation of similarity between a pair of process models has been intensively studied in recent years, there is a notable gap on automatically matching activities of two process models. In this paper, we develop an approach based on semantic techniques and probabilistic optimization. We evaluate our approach using a sample of admission processes from different universities

Logics for Approximating Implication Problems of Saturated Conditional Independence

Author: A. Dempster
A.P. Dawid
C. Herrmann
D. Geiger
D.M. Chickering
E. Lien
J. Biskup
J. Biskup
J. Biskup
J. Pearl
M. Lenzerini
M. Niepert
M. Niepert
M. Saar-Tsechansky
M. Schaerf
S. Lauritzen
S. Link
S. Link
S. Link
S. Wong
Y. Sagiv
Z. Galil
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref